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A mass of traces of human activities show rich dynamic patterns. In this article, we comprehen¬ 
sively investigate the dynamic patterns of 50 thousands of researchers’ activities in Sciencenet , the 
largest multi-disciplinary academic community in China. Through statistical analyses, we found 
that (i) there exists a power-law scaling between the frequency of visits to an academic forum and 
the number of corresponding visitors, with the exponent being about 1.33; (ii) the expansion pro¬ 
cess of academic forums obeys the Heaps’ law, namely the number of distinct visited forums to the 
number of visits grows in a power-law form with exponent being about 0.54; (iii) the probability 
distributions of time intervals and the number of visits taken to revisit the same academic forum 
both follow power-laws, indicating the existence of memory effect in academic forum activities. On 
the basis of these empirical results, we propose a dynamic model that incorporates the exploration, 
preferential return with memory effect, which can well reproduce the observed scaling laws. 

PACS numbers: 89.75.Da, 89.65.-s, 02.50.-r 


I. INTRODUCTION 


II. EMPIRICAL RESULTS 


It has been deemed that complicated factors are affect¬ 
ing the dynamic patterns of human activities, such as the 
priority of task 1 |4j, individual interest (5KZI? memory 
effects (Mil , deadline effects 0 , social contacts 115 
18], and so on. Relevant practical applications range 
from information spreading j!9H22j , decision making [23j 
to advertising [24], |25| and recommendation [261428^ . 

Although large amounts of empirical results on human 
dynamics have been reported in various fields [HI, l29l - [32| , 
researchers’ activities in academic forums are rarely in¬ 
vestigated and still not clearly understood as they usually 
accompanies with many endogenous and exogenous fac¬ 
tors, including the individual preference and professional 
background, the quality of a forum, the content of a post, 
and so on. To fill this gap, we study a data set sampled 
from Sciencenet (http://www.sciencenet.cn/) that con¬ 
tains the academic forum activities of many Chinese re¬ 
searchers. We observe novel dynamic patterns character¬ 
ized by the following statistical features: (i) the power- 
law relation between the frequency of visits to an aca¬ 
demic forum and the number of corresponding visitors; 
(ii) the Heaps’ law [HJ in the expansion process; (iii) 
the memory effect. We further propose a dynamic model 
that well reproduces the empirical observations. 
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Sciencenet is the largest multi-disciplinary academic 
community in China, which contains a blog system, a 
bulletin board system (BBS, consisting of 60 academic 
forums), and a virtual social network of researchers. Our 
data set keeps track of activities in the BBS between 
October/1/2007 and July/7/2011, composed of 366,524 
records from 49,578 researchers. Each record includes re¬ 
searcher ID, academic forum ID, posting/reviewing topic 
ID, and timestamp with resolution of minute. Table [T] 
presents the names, the visiting frequencies (i.e, total 
visits from all researchers), and the number of visitors of 
the 60 academic forums, ranked in the descending order 
of visiting frequencies. 


Rank 

Forum Name 

Frequency 

^visitors 

1 

Mathematics 

28,218 

5,849 

2 

Materials Science 

23,610 

6,293 

3 

Geology and Geophysics 

21,363 

4,025 

4 

Oceanography 

21,265 

4,019 

5 

Reading 

18,586 

5,315 

6 

Research Experience 

15,757 

6,713 

7 

Tea Break 

13,351 

4,209 

8 

Fund Application 

11,672 

4,045 

9 

Article 

11,511 

4,983 

10 

Analytical Chemistry 

9,228 

2,769 

11 

High-polymer Chemical 

8,271 

2,536 

12 

N anotechnology 

8,211 

2,815 

13 

Amorphous Alloy and Glass 

8,148 

1,151 

14 

Computer Science 

7,834 

2,709 

15 

Chemical Engineering 

7,541 

2,208 

16 

Catalysts 

7,541 

2,208 

17 

Organic 

6,952 

2,210 
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18 

Physics 

6,702 

2,547 

19 

Chemistry 

6,653 

2,860 

20 

Physical Resources 

6,289 

2,402 

21 

References 

6,102 

3,115 

22 

Study Abroad 

6,085 

2,781 

23 

Optics 

5,984 

1,759 

24 

Molecular and Cellular 
Immunology 

5,544 

2,132 

25 

Electronic Power 

5,539 

1,841 

26 

Mechanics 

5,306 

1,736 

27 

Control of Intelligent 

Modeling 

5,044 

1,763 

28 

Electrochemistry 

4,961 

1,453 

29 

Crystal 

4,775 

1,881 

30 

Mechanical 

4,772 

1,483 

31 

Theoretical Physics 

4,732 

1,804 

32 

Communication 

4,691 

1,600 

33 

Civil Engineering 

4,164 

905 

34 

Botany and Zoology 

3,973 

1,577 

35 

Clinical Medicine 

3,149 

1,021 

36 

Condensed Matter 

3,130 

1,195 

37 

Examination 

2,849 

1,460 

38 

Resources and Environment 

2,648 

1,271 

39 

Fine Chemicals 

2,464 

772 

40 

Materials and Methods 

2,452 

1,397 

41 

Pharmaceutical Chemistry 

2,357 

677 

42 

Chinese Medicine 

2,170 

548 

43 

History of Science 

2,142 

991 

44 

Engineering 

2,040 

1,080 

45 

Management 

1,920 

978 

46 

Academic Exhibition 

1,920 

978 

47 

GPS/RS/GIS 

1,862 

757 

48 

Genetics 

1,776 

769 

49 

Agronomy 

1,776 

769 

50 

General Chemistry 

1,692 

824 

51 

Astrophysics 

1,565 

509 

52 

Biophysical and 

Bioinformatics 

1,485 

711 

53 

Molecular Simulation 

1312 

571 

54 

Energy Science 

1,233 

546 

55 

Brain Science 

1,161 

553 

56 

Petroleum Exploration 

1,089 

450 

57 

Others 

631 

500 

58 

Cryptography 

625 

251 

59 

Alloy 

427 

176 

60 

Medical and Health 

Management 

275 

196 


TABLE I: Basic information of the 60 academic forums. 


At the aggregated level, as shown in Table U both the 
number of visits and the number of visitors are hetero¬ 
geneously distributed among forums. While at the indi¬ 
vidual level, the visiting behavior is also heterogeneous, 
indicated by the burstiness that a visitor usually stays 
in a forum for long time and then glances over several 
forums. Figure [l] illustrates the transition process of two 
example visitors, with visiting sequences shown in the 
first rows of Fig.[]Jc) and[]Jd). Figure [2] shows the tran¬ 
sition process of a typical real visitor, with two different 



FIG. 1: (Color online) An illustration of concepts in view of 
two visitors’ activities, (a)-(b) The transition process among 
different academic forums (black solid and red dash lines rep¬ 
resent visitor #1 and visitor #2, respectively), (c)-(d) The 
illustration of visiting sequences, as well as other relevant pa¬ 
rameters of the two visitors, including the number of distinct 
academic forums S(n ), the real time t, the number of visits 
n, the exploration of new academic forums AS'(n), the real 
time interval r and the click time interval An. 


scales: (a) real time and (b) click time. The colored ver¬ 
tical lines represent different academic forums. 


(a) 

(b) 
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FIG. 2: (Color online) A typical visitor’s transition process on 
academic forums along with (a) real time and (b) click time, 
suggesting a heterogeneous dynamic pattern that commonly 
exists in other online human dynamics [32[. The colored ver¬ 
tical line denotes the different academic forums. 


As shown in Fig. [3j there is a superlinear correlation 
between the number of visits to a academic forum (de¬ 
noted by F ) and the number of corresponding visitors 
(denoted by P), as F ~ P 7 with 7 « 1.33. This su¬ 
perlinear relationship maybe resulted from two reasons. 
Firstly, the forum that attracts more visitors is usually 
of higher quality, namely a post in this forum can at¬ 
tract a higher fraction of visitors in average. Secondly, 
many visits are induced by some previously comments 
and replies j&jj, in particular, visitors often care much 
about the replies to their own posts and comments. Such 
social cascading process may lead to a superlinear grow¬ 
ing trend since the maximum possible volume of social 
interaction is of the order P 2 . 

The expansion process of a visitor can be described 
by the number of distinct visited forums S(n), where 
n is the number of visits of this visitor. Taking visi- 
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FIG. 3: (Color online) A power-law relation between the fre¬ 
quency of visits to an academic forum and the number of cor¬ 
responding visitors. Each data point stands for an academic 
forum. 



FIG. 4: (Color online) (a) The probability distribution of the 
number of visits N for all visitors, (b) The scaling behav¬ 
ior in the expansion process. The black dash line indicates 
a power-law, S ~ n a , with scaling exponent a ~ 0.54. The 
red squares, purple stars and green diamonds represent three 
groups of visitors with more than 20, 50 and 100 visits, re¬ 
spectively. 


tor #1 in Fig. 0] as an example, his visiting sequence 
is {A,B,B,A,C,D,A,A,A,A}, and the corresponding S(n) 
sequence is {1,2,2, 2, 3,4,4,4,4,4}, as shown in the sec¬ 
ond row of Fig. [He). The activity level is a key feature 
of an user (HI, which may affect the expansion process. 
Here we use the total number of visits JV* to quantify the 
activity level of user Ui. As shown in Fig. 0(a), the dis¬ 
tribution of users’ activity levels is very heterogeneous, 
following a power law with exponent about 2.02, in ac¬ 
cordance with the empirical results in other online ac¬ 
tivities |32|. In common sense, users with different ac¬ 
tivity levels may behave differently, hence we pick out 
three groups of visitors whose visits are more than 20, 50 
and 100 times, respectively. Their expansion sequence 
S(n) in respect to the number of visits n is presented in 


Fig. [4(b), suggesting a robust scaling behavior S ~ n a , 
with the power-law exponent a ~ 0.54, similar to some 
observations in human mobility [35|, IHj . 

To see the preference to visit a new forum, we look at 
the exploration sequence A S(ri) = S(n + 1) — S(n). Sim¬ 
ple examples about the calculation of A S(ri) are shown 
in Fig. [1(c) and 0(d). As shown in Fig. [5(a), the rescaled 
exploration ^s(n)) scales in a power-law form with the 

number of visits, as (A5(nj) ~ n ^i where (3 ~ 0.50 and 
the rescaling factor (A S(n)) is averaged over all visitors 
in the considered group. The result confirms the scaling 
behavior found in expansion process since a + /3 « 1. 
Given the target visitor, if we denote P new the probabil¬ 
ity to visit a new forum and 1 — P new the probability to 
return to a previously visited forum, then the above re¬ 
sult suggests a scaling relation P new ~ pn ~^, where p is 
a constant. Figure [5(b) presents the distributions of p for 
the three groups of visitors, which are bell-shaped curves 
with mean values as (p) ~ 0.41, (p) ~ 0.36 and (p) ~ 0.32 
for N > 20, N > 50 and N > 100, respectively. 



FIG. 5: (Color online) (a) The scaling behavior of rescaled 
exploration (AS(n)) in respect to the number of visits n. (b) 
The probability distributions of the values of p for individual 
visitors. The definition of groups is same to that of figure [4] 


Memory is a very significant feature of human activi¬ 
ties 0 , l7 Hl3l IHI . [HI, which can be characterized by the 
probability distribution of the real time interval r be¬ 
tween two consecutive visits to the same forum or the 
number of visits An taken to revisit the same forum. For 
example, An for the forum B in the visiting sequence of 
visitor 7^2 are 6 and 3, as shown in Fig. 0(d). More ex¬ 
amples for both r and An can be found in Fig. 0(c) and 
0(d). Figure [6] reports the probability distributions of r 
and An of the three groups of visitors. Both P(r) and 
P(An) can be well fitted by power laws with exponents 
being 1.88 and 1.89, respectively. Such distributions sug¬ 
gest the existence of the memory effect since a visitor has 
higher probability to return to recently visited forums. 


III. MODEL 

The above-mentioned empirical results provide us in¬ 
sights to the dynamic patterns of visiting to academic 
forums. In particular, the memory effect assigns higher 
probability to a visitor to return to the forums being 
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FIG. 6 : (Color online) The memory effect in visiting activities, 
suggested by the power-law distributions of real time interval 
r and (b) the number of visits An. The guiding curve follows 
a power-law form, r -7 and A n~^ with 7 « 1.88 and £ ~ 
1.89, obtained by the maximum likelihood method [38]. The 
definition of groups is same to that of figure [U 


visited recently. Accordingly, we propose a model incor¬ 
porating three generic ingredients: (i) the exploration, 
indicating the tendency to visit a new forum; (ii) the 
preferential return, indicating the tendency to visit a pre¬ 
viously visited forum, with frequently visited ones being 
more attractive; (iii) the memory effect, indicating the 
tendency to revisit a recently visited forum. Notice that, 
the preferential return can be mathematically considered 
as a kind of memory with infinite memory length, how¬ 
ever, we distinguish it from memory effect since the pref¬ 
erential return may be resulted from the quality of a fo¬ 
rum and the match between the forum’s content and the 
visitor’s professional background. 

Figure [TJa) illustrates a schematic for the individual 
dynamic model. More specifically, after the nth visit, 
the target visitor has two choices for the next visit: (i) to 
visit a new academic forum with probability P new that 
depends on the number of previous visits n; (ii) to re¬ 
visit one of the S(n) previously visited forums with the 
complementary probability (1 — P ne w )• If the visitor vis¬ 
its a new forum at the (n + l)th visit, the expansion 
sequence updates as S(n + 1) = S(n) + 1, otherwise it 
keeps 5(n-|-l) = S(n). Initially, 5(1) = 1, and the 
probability P new evolves as 

Pnew OC =pn~ 13 . (1) 

an 

In our model, the parameter /3 = 0.50 is obtained from 
the empirical scaling exponent for the case TV > 20 (see 
Fig. [5fa)), while the parameter (p) = 0.41 is estimated 
from the mean value of p for the case N > 20 (see 
Fig. [5Kb)). 

When considering the return to a visited forum, we 
combine preferential return and memory effect into a uni¬ 
fied probability. The preferential return indicates that 
every visit to a forum will contribute to the probability 
to revisit this forum, while the memory effect suggests a 
power-law decay of the contribution strength. Given a 
forum i, if it appears fi times in the target visitor’s vis¬ 
iting sequence, we denote the discrete time interval be¬ 
tween the j th appearance to the current time step (n +1) 



10 ° 10 ’ 10 2 

An 


FIG. 7: (Color online) (a) The schematic of the individual 
dynamic model, (b) The scaling behaviors of expansion S(n) 
in respect to n for empirical data and the model, (c) The 
power-law distribution of return interval An obtained from 
the model. The parameters are set as N = 100, (p) = 0.41, 
P = 0.50, £ = 1.89, and the number of visitor is 10 3 , namely 
all the results can be considered as being averaged over 10 3 
independent runs. 


as 

A rJp = n + 1 - T^\ (2) 

where is the very time step when i just appears j 
times. As discussed above, every visit will be counted but 
the contribution decays with the length of time interval, 
so the probability to visit the forum i is 

E+ (A++ 

n i =- V . J ._ v (3) 

E Ef=i (a+) 

he r+ v ' 

where T u is the set of forums being visited by the target 
user u , and the decaying factor £ « 1.89 is estimated 
from Fig. [6Kb). The validation of the individual dynamic 
model is demonstrated in Fig.[7Kb) and [Tfc), from which 
one can see that the numerical results consist well with 
the empirical scaling behaviors for both expansion and 
revisit. 


IV. CONCLUSION 

In this paper, we comprehensively investigate the dy¬ 
namic patterns of researchers’ academic forum activities 
in Sciencenet. We show a power-law scaling between 
the frequency of visits to an academic forums and the 
number of corresponding visitors, which is similar to the 
allometric scaling law found in biology systems ©El- 
Meanwhile, at the individual level, the number of distinct 
visited forums S(ri) increases with the number of visits n 
in a power-law behavior, obeying the well-known Heaps’ 
Law [4l|, [42|, which is also similar to the previous studies 
on portal browsing activity and human mobility [35|, |36j . 
The memory effect in academic forum activity is unveiled 
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by the power-law distributions of the real time interval 
r and the number of visits An taken to revisit the same 
academic forum. 

Inspired by these empirical results and a previous theo¬ 
retical model jsBj, we propose a dynamic model, incorpo¬ 
rating with the exploration of new academic forums and 
the preferential return with memory effect. Through ex¬ 
tensively experimental testing, the numerical results from 
this dynamic model agree well with the empirical obser¬ 
vations, suggesting its validation. We have also checked 
that the lack of each of the three ingredients will lead to 
huge deviation from the real statistics. 
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